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Kaltofen has proposed a new approach in [8] for computing matrix determinants. 

\j2c< The algorithm is based on a baby steps/giant steps construction of Krylov subspaces, 

^^ ' and computes the determinant as the constant term of a characteristic polynomial. 

f . For matrices over an abstract field and by the results of Baur and Strassen [I], the 

determinant algorithm, actually a straight-line program, leads to an algorithm with 
r \ . the same complexity for computing the adjoint of a matrix [8]. However, the latter 

is obtained by the reverse mode of automatic differentiation and somehow is not 

"explicit" . We study this adjoint algorithm, show how it can be implemented (without 
q resorting to an automatic transformation), and demonstrate its use on polynomial 

matrices. 
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Kaltofen has proposed in [5] a new approach for computing matrix determinants. This approach has 
brought breakthrough ideas for improving the complexity estimate for the problem of computing the de- 
terminant without divisions over an abstract ring [8l lllj . The same ideas also lead to the currently best 
known bit complexity estimates for some problems on integer matrices such as the problem of computing 
■^+ , the characteristic polynomial [TTJ. 

We consider the straigth-line programs of [S] for computing the determinant over abstract fields or rings 
(with or without divisions). Using the reverse mode of automatic differentiation (see [TH [T3J [TJ] ) ; a straight- 
line program for computing the determinant of a matrix A can be (automatically) transformed into a program 
for computing the adjoint matrix A* of A [1] (see the application in [H §1.2] and [HJ Theorem5.1]). Since 
the latter program is derived by an automatic process, few is known about the way it computes the adjoint. 
The only available information seems to be the determinant program itself and the knowledge we have on the 
differentiation process. In this paper we study the adjoint programs that would be automatically generated 
by differentiation from Kaltofen's determinant programs. We show how they can be implemented with and 
without divisions, and study their behaviour on univariate polynomial matrices. 

Our motivation for studying the differentiation and resulting adjoint algorithms is the importance of the 
determinant approach of [8| lllj for various complexity estimates. Recent advances around the determinant 
of polynomial or integer matrices [5[ 1 1 1L 1151 I16j , and the adjoint of a univariate polynomial matrix in the 
generic case [7], also justify the study of the general adjoint problem. 

1 Kaltofen's determinant algorithm 

Let K be a commutative field. We consider A £ K nxn , u £ K nxl , and v £ K nxl . Kaltofen's approach extends 
the Krylov-based methods of [HI [10]. We introduce the Hankel matrix H = (uA l+ J~ 2 v)ij £ K nxn , and 
let hk = uA k v for < k < 2n — 1. We assume that H is non-singular. In the applications the latter is 
ensured either by construction of A, u, and v [8llllj. or by randomization (see |llj and references therein). 



With baby steps/giant steps parameters r = \2n/s] and s — \\/n\ (rs > 2n) we consider the following 
algorithm (the algorithm without divisions will be described in Section [3]). 



Algorithm Det [H] 

step 1. For i = 0, 1, . 
step 2. B = A r ; 
step 3. For j = 0, 1, . 
step 4. For i = 0, 1, . 
For j = 0,1,. 

step 5. Compute the minimum polynomial /(A) of the sequence {hk}o<k<2n-i] 
Return /(0). 



, r — 1 Do Vi :— A l v; 

, s — 1 Do Uj := uB J ; 

, r - 1 Do 

, s — 1 Do /i-j+jr := %'< 



2 The ajoint algorithm 

The determinant of A is a polynomial in K[an, . . . , ay, . . . , a nn ] of the entries of A. If we denote the adjoint 
matrix by A* such that AA* — A* A = (det A)I, then the entries of A* satisfy pQ: 
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-,1 < i,j < n. 



(1) 



The reverse mode of automatic differentiation (see [TJ [T2l [131 HI]) allows to transform a program which 
computes A into a program which computes all the partial derivatives in |T]) . We apply the transformation 
process to Algorithm Det. 

The flow of computation for the adjoint is reversed compared to the flow of Algorithm Det. Hence 
we start with the differentiation of Step 5. Consider the n x n Hankel matrices H — (uA l+: '~ 2 v)ij and 
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{uA 
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y. Then the determinant /(0) is computed as 
A = (detH A )/{dctH). 
Viewing A as a function A5 of the ftfe's, we show that 



dA 5 
dh k 



= ( ¥3fe _ 1 ( J ff^ 1 )-^ fc ( J ff- 1 ))A 



(2) 



where for a matrix M = (my) we define tpk(M) =0 + ^2, 



+j-2=k 



for 1 < k < 2n- 1. Identity (|2]) gives the 



first step of the adjoint algorithm. Over an abstract field, and using intermediate data from Algorithm Det, 
its costs is essentially the cost of a Hankel matrix inversion. 

For differentiating Step 4, A is seen as a function A4 of the Vi's and Uj's. The entries of Vi are involved 
in the computation of the s scalars hi, hi+ r , . . . , hi+u-xy. The entries of Uj are used for computing the r 
scalars hj r , hi+j r , ■ ■ ■ , /i( r _i)+j r . Let dvi be the lxn vector, respectively the nxl vector duj, whose entries 
are the derivatives of A4 with respect to the entries of Vi , respectively Uj . We show that 



dv 
dvi 



dv 



r-l 



= H V 
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U s -1 



and 



[ du Q , dui, . . . du s ^x ] = [ v , «i, . . . v r -i ] H u 



(3) 



(4) 



where H v and H u are r x s matrices whose entries are selected dA 5 /dhkS. Identities $5§ and (g]) give the 
second step of the adjoint algorithm. Its costs is essentially the cost of two n x y/n by s/n x y/n (unstructured) 
matrix products. 

Note that ^, ^ and (g| somehow call to mind the matrix factorizations [21 (3.5)] (our objectives are 
similar to Eberly's ones) and [H (3.1)]. 

Steps 3-1 of Det may then be differentiated. For differentiating Step 3 we recursively compute an n x n 
matrix dB from the Suj'a. The matrix dB gives the derivatives of A3 (the determinant seen as a function 
of B and the Wj's) with respect to the entries of B. 

For Step 2 we recursively compute from SB an n x n matrix 8 A that gives the derivatives of A2 (the 
determinant seen as a function of fj's). 

Then the differentiation of Step 1 computes from 8 A and the Svi's an update of 6 A that gives the 
derivatives of Ai = A. From (P) we know that A* = (SA) T . 

The recursive process for differentiating Step 3 to Step 1 may be written in terms of the differentiation 
of the basic operation (or its transposed operation) 

q:=pxM (5) 

where p and q are row vectors of dimension n and M is an n x n matrix. We assume at this point (recursive 
process) that column vectors Sp and Sq of derivatives with respect to the entries of p and q are available. We 
also assume that annxn matrix SM that gives the derivatives with respect to the m^'s has been computed. 
We show that differentiating ([5]) amounts to updating Sp and SM as follows: 



Sp := Sp + M x Sq, 
SM :=SM +p T x (Sq) 



.:/■ . ,,;,,./ ( 6 ) 



We see that the complexity is essentially preserved between ^ and ^ and corresponds to a matrix by 
vector product. In particular, if Step 2 of Algorithm Det is implemented in O(logr) matrix products, then 
Step 2 differentiation will cost 0(n 3 logr) operations (by decomposing the 0(n 3 ) matrix product). 

Let us call Adjoint the algorithm just described for computing A*. 

3 Application to computing the adjoint without divisions 

Now let A be an n x n matrix over an abstract ring R. Kaltofen's method for computing the determinant of A 
without divisions applies Algorithm Det on a well chosen univariate polynomial matrix Z(z) = C + z(A — C) 
where C € Z" x ™. The choice of C as well as a dedicated choice for the projections u and v allow the use of 
Strassen's general method of avoiding divisions [T71 [5]. The determinant is a polynomial A of degree n, the 
arithmetic operations in Det are replaced by operations on power series modulo z n+1 . Once the determinant 
of Z(z) is computed, (det Z)(l) = det(C + 1 x (A — C)) gives the determinant of A. 

In Step 1 and Step 2 in Algorithm Det applied to Z(z) the matrix entries are actually polynomials of 
degree at most i/n. This is a key point for reducing the overall complexity estimate of the problem. Since 
the adjoint algorithm has a reversed flow, this key point does not seem to be relevant for Adjoint. For com- 
puting det A without divisions, Kaltofen's algorithm goes through the computation of det Z(z). Adjoint 
applied to Z(z) computes A* but does not seem to compute Z*(z) with the same complexity. In particular, 
differentiation of Step 3 using ([6]) leads to products A l (5B) T that are more expensive over power series (one 
computes A(z) l (5B(z)) T ) than the initial computation in Det A r (A(z) r on series). 

For computing A* without divisions only Z*(l) needs to be computed. We extend algorithm Adjoint 
with input Z(z) by evaluating polynomials (truncated power series) partially. With a final evaluation at 
z = 1 in mind, a polynomial p(z) = po + p\z + . . . + p n _iz™ _1 + p n z n may typically be replaced by 



(po + Pi + • ■ • + Pm) + Pm+\X m + ■ ■ • + Pn-\z n 1 + p n z n as soon as any subsequent use of p(z) will not 
require its coefficients of degree less than m. 

4 Fast matrix product and application to polynomial matrices 

We show how to integrate asymptotically fast matrix products in Algorithm Ajoint. On univariate poly- 
nomial matrices A(z) with power series operations modulo z n , Algorithm Adjoint leads to intermediary 
square matrix products where one of the operand has a degree much smaller than the other. In this case we 
show how to use fast rectangular matrix products [2j [6] for a (tiny) improvement of the complexity estimate 
of general polynomial matrix inversion. 

Concluding remarks 

Our understanding of the differentiation of Kaltofen's determinant algorithm has to be improved. We have 
proposed an implementation whose mathematical explanation remains to be given. Our work also has to be 
generalized to the block algorithm of [TT] . 

Acknoledgements. We thank Erich Kaltofen who has brought reference [2] to our attention. 
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